3574 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
LDC
Size:
26 GByte Production Status:
Existing-used
Use:
Language Modelling
-
Paper title:Literal and Metaphorical Senses in Compositional Distributional Semantic Models
-
Paper track:Empirical/Data-Driven
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | E.Dario Gutierrez | University of California, San Diego | N/A |
| Author 2 | Ekaterina Shutova | University of Cambridge | N/A |
| Author 3 | Tyler Marghetis | University of Indiana | N/A |
| Author 4 | Benjamin Bergen | University of California, San Diego | N/A |
| Main Contact | E. Dario Gutierrez | University of California, San Diego | None |
Documentation:
Publicly available in English on website
Written
Ontology,
Language Type:
Multilingual
Languages:
English
Availability:
will be published on GitHub together with the extended version of the paper if accepted
License:
MIT
Size:
1 MByte Production Status:
Newly created-in progress
Use:
Knowledge Discovery/Representation
-
Paper title:A Regional News Corpora for Contextualized Entity Discovery and Linking
-
Paper track:Evaluation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Adrian Brasoveanu | MODUL University Vienna | AT |
| Author 2 | Lyndon J.B. Nixon | MODUL University Vienna | AT |
| Author 3 | Albert Weichselbraun | HTW Chur | CH |
| Author 4 | Arno Scharl | MODUL University Vienna | AT |
| Main Contact | Adrian Brasoveanu | MODUL University Vienna | None |
Documentation:
English
Written
Corpus,
Language Type:
Trilingual
Languages:
English Spanish french
Availability:
Freely Available
License:
Creative Commons by-nc-sa
Size:
132 <Not Specified>Production Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:An Analysis (and an Annotated Corpus) of User Responses to Machine Translation Output
-
Paper track:Evaluation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Daniele Pighin | Universitat Politècnica de Catalunya | None |
| Author 2 | Lluís Màrquez | Universitat Politècnica de Catalunya | None |
| Author 3 | Jonathan May | <Not Specified> | None |
| Main Contact | Daniele Pighin | Universitat Politècnica de Catalunya | ES |
Documentation:
English, inside the package
Written
Corpus,
Language Type:
Multilingual
Languages:
English German
Availability:
Freely Available
License:
Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International
Size:
4 MByte Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Multimodal Pivots for Image Caption Translation
-
Paper track:Empirical/Data-Driven
-
Paper status:Accept - Outstanding
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Julian Hitschler | Computational Linguistics, University of Heidelberg | DE |
| Author 2 | Shigehiko Schamoni | Heidelberg University | DE |
| Author 3 | Stefan Riezler | Heidelberg University | DE |
| Main Contact | Stefan Riezler | Heidelberg University | None |
Documentation:
http://www.statmt.org/wmt16/multimodal-task.html
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
From Data Center(s)
License:
Creative Commons (CC-BY-SA 3.0)
Size:
2.6 GByte Production Status:
Newly created-finished
Use:
Speech Recognition/Understanding
-
Paper title:Free English and Czech telephone speech corpus shared under the CC-BY-SA 3.0 license
-
Paper track:Speech
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Matěj Korvas | Charles University in Prague, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics | CZ |
| Author 2 | Ondřej Plátek | Charles University in Prague, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics | CZ |
| Author 3 | Ondřej Dušek | Charles University in Prague | CZ |
| Author 4 | Lukáš Žilka | Charles University in Prague, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics | CZ |
| Author 5 | Filip Jurčíček | Charles University in Prague, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics | CZ |
| Main Contact | Matěj Korvas | Charles University in Prague, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics | None |
Documentation:
English documentation is included in README files.
Written
Corpus,
Language Type:
Multilingual
Languages:
English
Availability:
Currently confirming ( Because our language resource is based on Ontonotes release-5.0 (LDC2013T19), we are currently being confirmed at LDC about the appropriate license for our resource. )
License:
'Currently confirming (Please see the above description in ''Resource Availability'')'
Size:
37000 sentences Production Status:
Newly created-finished
Use:
Parsing and Tagging
-
Paper title:Construction of an English Dependency Corpus incorporating Compound Function Words
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Akihiko Kato | Nara Institute of Science and Technology | JP |
| Author 2 | Hiroyuki Shindo | Nara Institute Science of Technology | JP |
| Author 3 | Yuji Matsumoto | Nara Institute Science of Technology | JP |
| Main Contact | Akihiko Kato | Nara Institute of Science and Technology | None |
Documentation:
Readme in English is available at the above URL.Language Type:
Multilingual
Languages:
English Spanish
Availability:
Freely Available
License:
Creative Commons Attribution - Non Commercial - Share Alike 3.0
Size:
11,292 relative and absolute quality assessments OtherProduction Status:
Newly created-in progress
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:The FAUST Corpus of Adequacy Assessments for Real-World Machine Translation Output
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Daniele Pighin | Universitat Politècnica de Catalunya | None |
| Author 2 | Lluís Màrquez | Universitat Politècnica de Catalunya | None |
| Author 3 | Lluís Formiga | Universitat Politècnica de Catalunya | None |
| Main Contact | Daniele Pighin | Universitat Politècnica de Catalunya | ES |
Documentation:
Within the package
Sign Language
Corpus,
Language Type:
Multilingual
Languages:
Australian Sign Language English
Availability:
Endangered Languages Archive, SOAS, London
License:
<Not Specified>
Size:
>1000 hours hours Production Status:
Archived but constantly updated as more annotations are made.
Use:
Empirical language description (grammar, dictionaries), language teaching
-
Paper title:Mouth-based non-manual coding schema used in the Auslan corpus: explanation, application and preliminary results
-
Paper track:Poster
-
Paper status:Accept as poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Trevor Johnston | Macquarie University | AU |
| Author 2 | Jane van Roekel | Macquarie University | None |
| Main Contact | Trevor Johnston | Macquarie University | None |
Documentation:
<Not Specified>
<Not Specified>
Word Embeddings,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
<Not Specified>
Size:
285055 words Production Status:
Newly created-finished
Use:
Document Classification, Text categorisation
-
Paper title:Evaluation of Domain-specific Word Embeddings using Knowledge Resources
-
Paper track:Evaluation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Farhad Nooralahzadeh | University of Oslo | NO |
| Author 2 | Lilja Øvrelid | Dept of Informatics, University of Oslo | NO |
| Author 3 | Jan Tore Lønning | University of Oslo | NO |
| Main Contact | Farhad Nooralahzadeh | University of Oslo | None |
Documentation:
<Not Specified>
Modality Independent
Tagger/Parser,
Language Type:
Multilingual
Languages:
English
Availability:
Freely Available
License:
open source
Size:
<Not Specified> Production Status:
Existing-used
Use:
Morphosyntactic annotation
-
Paper title:Building comparable corpora from social networks
-
Paper track:<Not Specified>
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Marwa Trabelsi | Laboratoire LIPAH, Faculté des sciences de Tunis, Département des Sciences de l’informatique 1060 Tunis, Tunisie | TN |
| Author 2 | malek hajjem | Laboratoire de recherche LISI, INSAT Tunis carthage | TN |
| Author 3 | chiraz latiri | Laboratoire LIPAH, Facult des sciences de Tunis, Dpartement des Sciences de l’informatique 1060 Tunis, Tunisie | TN |
| Main Contact | Marwa Trabelsi | Laboratoire LIPAH, Faculté des sciences de Tunis, Département des Sciences de l’informatique 1060 Tunis, Tunisie | None |
Documentation:
yes in english




